[SharedOffloadRegion] Align blocks to page-size by varun-sundar-rabindranath · Pull Request #43689 · vllm-project/vllm

varun-sundar-rabindranath · 2026-05-26T16:38:37Z

Purpose

Align blocks in SharedOffloadRegion to page_size so that O_DIRECT succeeds.

Changes:

Update CPUOffloadingSpec to account for alignment
Update SharedOffloadRegion to compute aligned row_strides
Update test_fs_tier.py to use SharedOffloadRegion for robust testing and testing the interplay between fs_tier and SharedOffloadRegion

Interface change:

CPUOffloadingSpec constructor inputs a block_size_alignment integer that defaults to 1
SharedOffloadRegion directly consumes the user specified cpu_bytes_to_use.

Test Plan

Run pytest -s tests/v1/kv_offload/test_fs_tier.py multiple times locally.

Test Result

Test passes.

orozery · 2026-05-26T17:12:45Z

Thanks @varun-sundar-rabindranath !
This fixes the test, but leaves the possible underlying issue.
Once we fix the underlying issue, this test would actually prove we fixed it.

varun-sundar-rabindranath · 2026-05-26T17:43:09Z

Thanks @varun-sundar-rabindranath ! This fixes the test, but leaves the possible underlying issue. Once we fix the underlying issue, this test would actually prove we fixed it.

Thanks for taking a look @orozery. The fixes are the root cause of the test failure. The current set of tests dont invoke the SharedOffloadRegion. These are targeted tests that test the fs_tier directly.

IMO Making sure that the CPU backing tensor is always aligned should be a separate test. what do you think ?

varun-sundar-rabindranath · 2026-05-27T04:38:03Z

Thanks @varun-sundar-rabindranath ! This fixes the test, but leaves the possible underlying issue. Once we fix the underlying issue, this test would actually prove we fixed it.

Thanks for taking a look @orozery. The fixes are the root cause of the test failure. The current set of tests dont invoke the SharedOffloadRegion. These are targeted tests that test the fs_tier directly.

IMO Making sure that the CPU backing tensor is always aligned should be a separate test. what do you think ?

Hi @orozery I have updated the tests to use SharedOffloadRegion directly for better tests and have updated SharedOffloadRegion to always align its rows to page size boundaries. PTAL! Thanks.

orozery

Thanks @varun-sundar-rabindranath !
Can you please change the PR title and description to reflect it aligns CPU pages?

orozery · 2026-05-27T06:27:06Z

+
        self.page_size = mmap.PAGESIZE
+        self.num_blocks = num_blocks
+        self.total_size_bytes, self._row_stride = self._maybe_update_buffer_size(


This potentially violates the user's cpu_bytes_to_use, allocating more than the user allowed.
I think we want to add an alignment classvar in cpu/spec.py, which will be overrided in tiering/spec.py:

# CPUOffloadingSpec class CPUOffloadingSpec(OffloadingSpec): CPU_PAGE_SIZE_ALIGNMENT = 1 def __init__(self, vllm_config, kv_cache_config): ... kv_bytes_per_offloaded_block = kv_bytes_per_block * self.block_size_factor self.cpu_page_size_per_worker = round_up( kv_bytes_per_offloaded_block // world_size, self.CPU_PAGE_SIZE_ALIGNMENT, ) self.num_blocks = ( int(cpu_bytes_to_use) // (self.cpu_page_size_per_worker * world_size) if self.cpu_page_size_per_worker > 0 else 0 ) ... # TieringOffloadingSpec class TieringOffloadingSpec(CPUOffloadingSpec): CPU_PAGE_SIZE_ALIGNMENT = SharedOffloadRegion.PAGE_SIZE_ALIGNMENT ...

Nice catch. I agree that cpu_bytes_to_use should be respected.

Concern:
Expanding individual cpu_page_size_per_worker looks like it'll break a invariants and introduce some hard-to-catch bugs. i.e. We are going from,

B0 |<--- B0 W0 ---><--- B0 W1 ---><--- B0 W2 --->| B1 |<--- B1 W0 ---><--- B1 W1 ---><--- B1 W2 --->| B2 |<--- B2 W0 ---><--- B2 W1 ---><--- B2 W2 --->| ... where Bi - Block i ; Wj - Worker j

to

B0 |<--- B0 W0 ---***pad***><--- B0 W1 ---***pad***><--- B0 W2 ---***pad***>| B1 |<--- B1 W0 ---***pad***><--- B1 W1 ---***pad***><--- B1 W2 ---***pad***>| B2 |<--- B2 W0 ---***pad***><--- B2 W1 ---***pad****><--- B2 W2 ---***pad***>| ... where Bi - Block i ; Wj - Worker j

One example is the assert in cpu <-> gpu transfer
assert cpu_page_size == gpu_page_size * block_size_factor

vllm/vllm/v1/kv_offload/cpu/gpu_worker.py

Line 154 in 52a31cc

assert cpu_page_size == gpu_page_size * block_size_factor

This padding will have to be plumbed through and handled correctly.

Instead I propose doing,

B0 |<--- B0 W0 ---><--- B0 W1 ---><--- B0 W2 --->***pad***| B1 |<--- B1 W0 ---><--- B1 W1 ---><--- B1 W2 --->***pad***| B2 |<--- B2 W0 ---><--- B2 W1 ---><--- B2 W2 --->***pad***| ...

which is a looser constraint and can be handled directly in SharedOffloadRegion. Respecting cpu_bytes_to_use can be handled by,

Allocating less num_blocks in CPUOffloadingSpec when padding is involved (communicated via CPU_PAGE_SIZE_ALIGNMENT classvar or a constructor arg)

passing cpu_bytes_to_use directly to SharedOffloadRegion.

And introducing padding in SharedOffloadRegion that respects alignment and cpu_bytes_to_use

orozery · 2026-05-27T06:31:50Z

+    region, tensor, mock_view = _make_region_tensor_and_view(
+        num_blocks=4,
+        block_elements=_BLOCK_ELEMENTS,
+        instance_prefix="test-fs-tier",
+    )


Assuming we move the alignment code further up to spec.py, let's go back to a simple tensor allocation here, but using an aligned page size.

Keeping this as the new set of changes still updates SharedOffloadRegion directly. PTAL. Thanks 🙌

orozery · 2026-05-28T10:50:05Z

+            assert self.cpu_bytes_to_use >= aligned_kv_bytes_per_offloaded_block, (
+                f"CPU space insufficient for offloading. {self.cpu_bytes_to_use=} "
+                f"{kv_bytes_per_offloaded_block=} "
+                f"{aligned_kv_bytes_per_offloaded_block=} "
+                f"{self.block_size_alignment=}"
+            )


Why do we need this assert?

This assert is making sure that the num_blocks is not 0. It indicates that the user should increase cpu_bytes_to_use to run with CPU offloading enabled.

orozery · 2026-05-28T10:50:24Z

+        self,
+        vllm_config: VllmConfig,
+        kv_cache_config: KVCacheConfig,
+        block_size_alignment: int = 1,


Let's use a classvar instead of introducing a new init param

Updated to introduce a classvar.
I am a bit uncomfortable with the semantic difference between CPUOffloadingSpec.BLOCK_SIZE_ALIGNMENT vs self.BLOCK_SIZE_ALIGNMENT seems like it increases the surface for bugs. what do you think ?

mergify · 2026-05-28T13:59:15Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @varun-sundar-rabindranath.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

orozery · 2026-05-28T15:44:13Z

+            assert int(cpu_bytes_to_use) >= self.kv_bytes_per_offloaded_block_pad, (
+                f"CPU space insufficient for offloading. {cpu_bytes_to_use=} "
+                f"{self.kv_bytes_per_offloaded_block=} "
+                f"{self.kv_bytes_per_offloaded_block_pad=} "
+                f"{self.BLOCK_SIZE_ALIGNMENT=}"
+            )


Same as the previous review: Why do we need this assert?

I replied in the previous review - #43689 (comment)

IMO This is a bit far fetched.
And if have a single block, what is it good for?

I have reverted the assert.
but, I believe it is better to be defensive in this case - for example I think having zero num_blocks will fail mmap in SharedOffloadRegion because num_blocks is zero. What do you think ? maybe we should handle it elsewhere.

orozery · 2026-05-30T17:30:28Z

@varun-sundar-rabindranath can you please address the remaining nits?

varun-sundar-rabindranath · 2026-05-31T17:05:30Z

@varun-sundar-rabindranath can you please address the remaining nits?

Hi @orozery I have addressed the comments. PTAL! thanks 🙌

orozery

Thanks @varun-sundar-rabindranath !

orozery · 2026-06-02T14:22:53Z

@varun-sundar-rabindranath test failing:
https://buildkite.com/vllm/ci/builds/69346#019e8591-14d4-4f18-ac27-d3afcd3030d0

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com>

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com> Co-authored-by: varun sundar rabindranath <vsundarr@redhat.com> Signed-off-by: Matt Van Horn <455140+mvanhorn@users.noreply.github.com>

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com> Co-authored-by: varun sundar rabindranath <vsundarr@redhat.com> Signed-off-by: JisoLya <523420504@qq.com>

varun-sundar-rabindranath requested review from ApostaC and orozery as code owners May 26, 2026 16:38

varun-sundar-rabindranath mentioned this pull request May 26, 2026

[BugFix] FS Offloading: Fallback from O_DIRECT #43674

Closed

mergify Bot added the v1 label May 26, 2026

orozery requested changes May 27, 2026

View reviewed changes

varun-sundar-rabindranath changed the title ~~Fix test_fs_tier.py~~ [SharedOffloadRegion] Align blocks to page-size May 27, 2026

varun-sundar-rabindranath requested a review from orozery May 27, 2026 15:57

orozery requested changes May 28, 2026

View reviewed changes

mergify Bot added the needs-rebase label May 28, 2026

varun-sundar-rabindranath force-pushed the varun/align-test-tensor branch from 38b0fd3 to 9c8baee Compare May 28, 2026 15:21

mergify Bot removed the needs-rebase label May 28, 2026

varun-sundar-rabindranath requested a review from orozery May 28, 2026 15:27

orozery requested changes May 28, 2026

View reviewed changes

varun-sundar-rabindranath requested a review from orozery May 28, 2026 16:24

orozery mentioned this pull request May 30, 2026

[Bugfix] Fall back to buffered I/O if O_DIRECT fails in secondary tier fs #44016

Closed

orozery reviewed May 31, 2026

View reviewed changes

Comment thread tests/v1/kv_offload/test_fs_tier.py Outdated

Comment thread tests/v1/kv_offload/test_fs_tier.py Outdated

orozery reviewed Jun 1, 2026

View reviewed changes

Comment thread vllm/v1/kv_offload/cpu/shared_offload_region.py

varun-sundar-rabindranath requested a review from orozery June 1, 2026 06:01

orozery approved these changes Jun 1, 2026

View reviewed changes

orozery added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 1, 2026

varun-sundar-rabindranath force-pushed the varun/align-test-tensor branch 2 times, most recently from 7773cf1 to 8414e03 Compare June 1, 2026 23:40

varun-sundar-rabindranath force-pushed the varun/align-test-tensor branch from 8414e03 to 562e361 Compare June 2, 2026 15:07

varun sundar rabindranath added 2 commits June 3, 2026 04:06

align test tensor

45eab57

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com>

respect shared offload region's BSA

0b2a808

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com>

varun-sundar-rabindranath force-pushed the varun/align-test-tensor branch from 562e361 to 0b2a808 Compare June 3, 2026 08:14

orozery merged commit 3d76f39 into vllm-project:main Jun 3, 2026
48 checks passed

Uh oh!

Conversation

varun-sundar-rabindranath commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

orozery commented May 26, 2026

Uh oh!

varun-sundar-rabindranath commented May 26, 2026

Uh oh!

varun-sundar-rabindranath commented May 27, 2026

Uh oh!

orozery left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

varun-sundar-rabindranath May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mergify Bot commented May 28, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

orozery commented May 30, 2026

Uh oh!

varun-sundar-rabindranath commented May 31, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

orozery left a comment

Choose a reason for hiding this comment

Uh oh!

orozery commented Jun 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

varun-sundar-rabindranath commented May 26, 2026 •

edited

Loading

varun-sundar-rabindranath May 27, 2026 •

edited

Loading